computer vision pipeline
CLAIRE-DSA: Fluoroscopic Image Classification for Quality Assurance of Computer Vision Pipelines in Acute Ischemic Stroke
Berg, Cristo J. van den, Nijenhuis, Frank G. te, Blaauboer, Mirre J., van Erp, Daan T. W., Keppels, Carlijn M., van der Sluijs, Matthijs, Roozenbeek, Bob, van Zwam, Wim, Cornelissen, Sandra, Ruijters, Danny, Su, Ruisheng, van Walsum, Theo
Computer vision models can be used to assist during mechanical thrombectomy (MT) for acute ischemic stroke (AIS), but poor image quality often degrades performance. This work presents CLAIRE-DSA, a deep learning--based framework designed to categorize key image properties in minimum intensity projections (MinIPs) acquired during MT for AIS, supporting downstream quality control and workflow optimization. CLAIRE-DSA uses pre-trained ResNet backbone models, fine-tuned to predict nine image properties (e.g., presence of contrast, projection angle, motion artefact severity). Separate classifiers were trained on an annotated dataset containing $1,758$ fluoroscopic MinIPs. The model achieved excellent performance on all labels, with ROC-AUC ranging from $0.91$ to $0.98$, and precision ranging from $0.70$ to $1.00$. The ability of CLAIRE-DSA to identify suitable images was evaluated on a segmentation task by filtering poor quality images and comparing segmentation performance on filtered and unfiltered datasets. Segmentation success rate increased from $42%$ to $69%$, $p < 0.001$. CLAIRE-DSA demonstrates strong potential as an automated tool for accurately classifying image properties in DSA series of acute ischemic stroke patients, supporting image annotation and quality control in clinical and research applications. Source code is available at https://gitlab.com/icai-stroke-lab/wp3_neurointerventional_ai/claire-dsa.
Pac-Man Pete: An extensible framework for building AI in VEX Robotics
Zietek, Jacob, Wade, Nicholas, Roberts, Cole, Malek, Aref, Pylla, Manish, Xu, Will, Patil, Sagar
We identify and develop three separate critical components. This includes a Unity simulation and reinforcement learning model training pipeline, a malleable computer vision pipeline, and a data transfer pipeline to offload large computations from the VEX V5 Brain/micro-controller to an external computer. We give the community access to all of these components in hopes they can reuse and improve upon them in the future, and that it'll spark new ideas for autonomy as well as the necessary infrastructure and programs for AI in educational robotics.
Computer Vision Pipeline with Kubernetes
We produce a multitude of attributes (characteristics attached to an entity -- building, parcel, etc.) using various sources such as aerial imagery. The idea is to build Deep Learning models from a few thousand buildings using in-house-tagged labels or existing labels from open data. In a second step, the models are deployed on the whole French territory, which represents more than 35 million images to process (i.e. 4 TB of data to deal with). This second step is the focus of this post. The challenge is to be able to infer at low cost and in a short amount of time, (less than a day).
Geometry-Based Grasping of Vine Tomatoes
de Haan, Taeke, Kulkarni, Padmaja, Babuska, Robert
We propose a geometry-based grasping method for vine tomatoes. It relies on a computer-vision pipeline to identify the required geometric features of the tomatoes and of the truss stem. The grasping method then uses a geometric model of the robotic hand and the truss to determine a suitable grasping location on the stem. This approach allows for grasping tomato trusses without requiring delicate contact sensors or complex mechanistic models and under minimal risk of damaging the tomatoes. Lab experiments were conducted to validate the proposed methods, using an RGB-D camera and a low-cost robotic manipulator. The success rate was 83% to 92%, depending on the type of truss.
Computer Vision: A Key Concept to Solve Many Image Data Problems
This article was published as a part of the Data Science Blogathon. Computer Vision is evolving from the emerging stage and the result is incredibly useful in various applications. It is in our mobile phone cameras which are able to recognize faces. It is available in self-driving cars to recognize traffic signals, signs, and pedestrians. Also, it is in industrial robots to monitor problems and navigating around co-workers.
Bringing together IoT, Computer Vision, and Machine Learning
To intelligently connect many pieces of legacy infrastructure to the internet, you'll need to do it in such a way insights can be gleaned from the all data it generated. The premise sounds simple enough, but in practice the project would require deep knowledge of a wide range of technologies. By breaking the problem down into discrete, logical pieces, we were able to prove that a working solution was possible. Take for example, an application involving an IoT device, computer vision, and machine learning. The first step required is determining the best approach to retrofitting the legacy infrastructure.